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ABSTRACT 


A comparison was made between CDS/ISIS, its Windows version WINISIS, and 
InMagic’s INMAGIC and DB/TextWorks software. Packages were evaluated for their 
database creation, information retrieval and report production capabilities. Windows 
versions were found to provide significant enhancements over DOS versions of soft- 
ware. The evaluation aimed to determine the advantages to a developing country of cre- 
ating bibliographic databases using commercial software. 


1. Introduction 
1.1 Software categories 


A great deal of personal computer software is now available for organisation of 
information. Such software, or information management systems, usually 
comprise structured database and information retrieval components. 

A number of writers have advanced categories for such software. For exam- 
ple Burton and Petrie! differentiate between conventional database management 
systems and information retrieval systems. McCarthy? distinguishes between 
file management systems, database management systems, preconfigured data 
management systems, and text-oriented data management packages. Sieverts 
and Hofstede? suggest grouping into classical retrieval systems, end-user soft- 
ware, and ‘other’ (indexing, full-text retrieval, hypertext, personal information 
management, and best match and ranking programs). The classical retrieval sys- 
tems resemble those developed for online information retrieval systems. 
Saffady* uses the term ‘text storage and retrieval systems’ for full-text retrieval 
systems — software products that operate on the complete content of a document. 

We use the term textual storage and retrieval systems to indicate the group 
of products belonging to the classical category in the Sieverts and Hofstede 
classification. Management of structured text can be seen as a specific applica- 
tion that requires software used for this purpose to have special features. 
Structured textual data (for example, bibliographic descriptions) have charac- 
teristics, such as variable length fields and records and multiple values for 
fields, that make them difficult to manage with generic database systems such 
as Access, DBase, and Oracle with their orientation towards numerical, fixed- 
length field data. Features and capacity of text storage and retrieval systems 
vary markedly. Selection of the most appropriate software package for a par- 
ticular environment is challenging. 
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1.2 CDS/ISIS and InMagic 


We considered two Inmagic Inc. products, INMAGIC PLUS 1.0 and DB/ 
TextWorks 1.0 together with Unesco’s CDS/ISIS and WINISIS. Versions of 
these packages have been described favourably in a variety of contexts. 

The DOS version of CDS/ISIS is a well-known textual information storage 
and retrieval package in the developing countries. Kumar and Kar° reviewed 
the world-wide application of CDS/ISIS, and showed that the use of CDS/ISIS 
is an inexpensive approach to library computerisation. Sieverts et al’ when 
comparing it with eight other programs, including INMAGIC 7.2, concluded 
that despite some deficiencies, ““CDSAISIS is a very powerful and versatile pro- 
gram”. Mahmood’ considered it the best library software for developing coun- 
tries. Since the late 1980s CDS/ISIS has become one of the most popular 
packages of its kind in the world. Over 15,000 registered copies of the software 
have been distributed free-of-charge (or at cost price), in both developed and 
developing countries.’ 

INMAGIC PLUS is among the best-selling library software in the US anda 
number of other countries. Cibbarelli°? reported about 30,000 Inmagic installa- 
tions in 55 countries in 1995. It is a popular library software in law firms, uni- 
versities, government agencies and other business building text-based retrieval 
systems.'° Inmagic DB/TextWorks, a software package for Windows-based 
PCs and networks, was introduced in 1995.!'! It includes ‘relational-like’ data- 
base linking, integrated image management and other features. Costs depend 
upon type of implementation and range from US$795 for single user, 
US$2,900 for 5-user to US$4,500 for 10-user and up.!? 


1.3 Purpose of evaluation 


The evaluation was carried out in order to determine the advantages to a devel- 
oping country (one of the authors is from Vietnam) of creating bibliographic 
databases using commercial software, and the extent to which these advan- 
tages were offset by the cost of such software. It had the secondary purpose of 
comparing the utility of the packages for illustration of storage and retrieval 
principles in an instructional information management environment. 

The particular packages were examined because of their significant local 
use in the business environment (in the case of InMagic products) and educa- 
tional environment (in the case of CDS/ISIS), and because of implementations 
of CDS/ISIS in a number of countries in the South-East Asian region. 


2. Overview of the software 
2.1. CDS/ISIS 


Micro CDS/ISIS is based on the mainframe version of CDS/ISIS which was 
developed in the late 1960s. It may be regarded as a classical information 
retrieval system. Table 1 itemises some of its key features. 
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Table 1. CDS/ISIS features 


e Handling of variable length records, fields and subfields 

User-defined database with different data types to be processed for a particular 
application 

e Entering and modifying data through user-created database specific worksheets 

e Retrieval with boolean and proximity search operators as well as free-text searching 
e Sorting and report generating facility for directories, catalogues, indexes, etc. 
° 
° 





Data interchange function based on the ISO 2709 
CDS/ISIS Pascal integrated application development programming language 
allowing users to tailor the system to specific needs 

e Functions that allow users to build pseudo-relational databases 

e Multilingual package with tools for local linguistic versions 

e Utilities that allow users to create system and used-defined menus and/or worksheets 





Unesco distributes CDS/ISIS in a variety of languages including English, 
French and Spanish versions of the package and has helped in developing ver- 
sions for other languages including Arabic, Chinese and Korean. 

The first version 1.0 was released in December 1985. It is designed to run 
on IBM PC/XT with minimal (256K) RAM and hard disc requirements. It 
comprised six different programs to execute the various functions. Databases 
were able to handle up to 32,000 records. In 1989, version 2.3 was distributed, 
integrating all programs into one.'* The database capacity was extended to 
16,000,000 records, and the PASCAL programming language interface was 
introduced. Release 3.0, which supports local area networks, followed in 1992. 
The latest MSDOS version 3.07 was distributed in 1994. The Beta test version 
1.0 for Windows, WINISIS, was launched in 1996. The version we evaluated 
is the June 1997 version. 

Functions are carried out by a set of eight major services grouped into two 
categories: user services and system services.The four user services are: 


ISISENT Data entry and record editing 
ISISRET Information retrieval 
ISISPRT Printing and sorting 
ISISINV Inverted file maintenance. 


The four system services are: 


ISISDEF Database definition or modification 

ISISKCH Data exchange facilities and master file maintenance 
ISISUTL Utilities 

ISISPAS Advanced programming facility. 


A database consists of logically related but physically distinct computer files. 
File names are listed in Table 2. In the table, xxxxxx represents the name of the 
database, and yyyyyy represents user-chosen file name. A database definition 
consists of four mandatory components each stored in separate files: Field 
Definition Table (.FDT), Data Entry Worksheets (.FMT), Display/Print 
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Table 2. CDS/ISIS database files 





File name 


Notes/Functions 





XXxxxx.FDT 


PXxxxx. FMT 


pyyyyy.FMT 
XXXxxx.PFT 


yyyyyy-PFT 
XXXXxx. FST 


yyyyyy-FST 


Xxxxxx.MST 
XXXxxx.XRF 


XXxxxx.CNT 
xxxxxx.N01 
Xxxxxx.L01 
Xxxxxx.N02 
XXxxxx.IFP 
Xxxxxx. ANY 


Field Definition Table (FDT); defines the fields and their charac- 
teristics 

Data Entry Worksheets (FMT); define the screen layout used to 
create records of a database 

Display/Print Formats; define the screen display and print layout 
of records 

Field Select Table (FST); the first FST (with database name) 
defines fields to be indexed and indexing techniques for them; 
additional FSTs are used for data exchange 

Master File; a data file which contains all records of a given data- 
base 

Cross-reference File; gives the location of each record in the 
Master File; must be copied with the master file 

B*tree (search term dictionary) control file 

B*tree nodes (for terms up to 10 characters long) 

B*tree Leaf (for terms up to 10 characters long) 

B*tree nodes (for terms longer than 10 characters) 

Inverted file posting 

Any file 


XXXXxx.STW Stopwords file 





Formats (.PFT), and Field Select Table (.FST). The inverted file is an index file 
of the master file (which contains all database records). There is only one 
inverted file (.IFP) for one given database. It contains all terms that are used as 
access points during the retrieval for a given database. Information on how 
terms are selected to be included in the file is defined by FST. 

The ANY file (ANY) is an optional file that is associated with the inverted 
file. It defines ‘any’ terms, each of which is a collective name assigned to a 
group of search terms. CDS/ISIS will automatically use all terms that are 
assigned in the given ‘any’ term to form a search operation. For example 
‘ANY Southeast-Asia’ could define the name of all countries in the region 
(Indonesia, Malaysia, Thailand ...) 

The two basic components of CDS/ISIS are its menu system and work- 
sheets (System and Data Entry). Menu modification is undertaken using the 
System Utilities Services. System Worksheets are used for stipulating parame- 
ters that are needed to execute a given function, e.g. printing, data exchange. 
They are predefined but may be redefined. Data Entry Worksheets are used to 
create or modify a database record. 

The Windows version of CDS/ISIS is being developed. The June 1997 Beta 
release may be downloaded from Unesco’s Internet site. This version includes: 


— Data entry module (using existing worksheet files) 
— Print and sort modules (using existing formats) 
— A Guided Search Interface (OPAC) 


286 
Aslib, The Association for Information Management 
Staple Hall, Stone House Court, London EC3A 7PB 
Tel: +44 (0) 171 903 0000, Fax: +44 (0) 171 903 0011 
Email: pubs @aslib.co.uk, WWW: http://www.aslib.co.uk/aslib 


Program, Vol 32 No 3, 1998 
© Aslib, The Association for Information Management. 


All rights reserved. Except as otherwise permitted under the Copyright, Designs and Patents Act 
1988, no part of this publication may be reproduced, stored in a retrieval system, or transmitted 
in any form or by any means, electronic, mechanical, photocopying or otherwise without the 
prior written permission of the publisher. 


July 1998 Textual storage and retrieval software 


— Amore complete Online-Help File 
— ISIS-DOS indentation commands 
—  Internationalisation (English and Italian messages are provided). 


The policy for CDS/ISIS development includes: 


— Rewriting in C programming language in order to provide a common 
standardised language for all the versions (MSDOS, WINDOWS, 
UNIX, VAX/VMS); 

— Adopting a multi-platform software development system in order to 
increase portability; 

— Implementation of client-server architecture. 


A network of some 135 officially appointed distributors around the world has 
been established. The conditions on use of CDS/ISIS are specified in the 
Licence Agreement signed between Unesco (or the official distributor) and the 
receiving institution. 


2.2. INMAGIC 


INMAGIC is a software package developed by the Inmagic Inc.®. It includes 
necessary tools to perform database definition and maintenance, information 
retrieval, and report production. The first version was introduced in 1980 and 
was written in the FORTRAN language. In 1984, INMAGIC was rewritten in 
C and upgraded for microcomputers.'° In 1989, SearchMAGIC, a search-only 
end-user interface, was released, allowing the users to distribute INMAGIC 
databases to others. In 1992, SearchMAGIC was combined with INMAGIC 
7.2 and released as INMAGIC PLUS, including capacity for image manage- 
ment.!” It is not an Optical Character Recognition package, but images can be 
stored, displayed and printed. The company also produces INMAGIC PLUS 
for Libraries and INMAGIC PLUS for Law Firms, incorporating predefined 
database definitions and report formats. 

Like a CDS/ISIS database, an INMAGIC database consists of records that 
contain a collection of fields. File types used are listed in Table 2, where 
XXXXXX represents database name. The .DIC and .DTM files must be synchro- 
nised with the .DAT to avoid database errors. The data structure file .STR must 
be created prior to database creation. Users communicate with INMAGIC 
through a menu hierarchy. 

DB/TextWorks, which is a software package for Windows-based PCs and 
networks, was introduced in 1995. Inmagic, Inc. refers to DB/TextWorks as 
textbase software with integrated image management for Windows.'* It has much 
in common with the INMAGIC PLUS, but with additional features such as: 


—  relational-like linking, enabling common data in one textbase to be ref- 
erenced by other data; 

— drag and drop WYSIWYG (What You See Is What You Get) report 
writer design; 
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Table 3. INMAGIC files 

File name Notes 

XXXXxx.STR Data structure; more than one database may use a structure 

XXXxxx.DAT Records and their indexes 

XXxxxx.DIC Directory of the records within the .DAT file; if the .DIC file is 
lost, the record in the database cannot be retrieved 

XXxxxx.DTM List of database changes made but not yet indexed.; must be 
copied when .DAT is 

Xxxxxx.DQM Quick index of records within .DTM; must be copied when 
.DAT is 

XXxxxx. FMO Report definition file 

XXXxxx. FMS Report format definition form; written out to a file for printing 
or editing 

XXXxxx.SMU Set of search prompts and a record display format available for 
use with a database 

XXXxxx.SRy User work file; provides a place for INMAGIC to store 
searches, as well as workspace for sorts; last character ‘y’ is a 
work file ID code 


— report printing fully enabled with Windows fonts; 

— image management capabilities supporting many standard file formats, 
including TIFF, JPEG, and BMP; 

— customising user interaction with databases with menu screen designer; 

— search indexes may be produced for all fields and 250 fields are permit- 
ted per database; 

— more field types including text, numbers, date, are permitted; 

— Internet publishing is facilitated by the use of the HTML Write-to-file 
feature to publish reports created in DB/TextWorks. 


When a textbase is created, DB/TextWorks generates a number of files, each 
of which have the same name as the textbase, but a different extension as in 
Table 4. 


3. DOS software evaluation 
3.1 Criteria 


Several approaches have been taken to evaluation of information management 
systems. Some focus upon analysis of information retrieval features of the 
search interface! and include search functions, user assistance and output 
flexibility. Others look more broadly at the software packages in their environ- 
ment and include criteria such as input and maintenance of data, indexing of 
stored information, interactive searching, output features, use of the program, 
special versions and security, and technical requirements of software.*?! We 
have used this broader approach. However, since the evaluation was carried 
out using small test collection databases of about 50 records on the subject of 
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Table 4. Textbase files 

Extension Description 

TBA Primary textbase definition file, also contains elements stored in the 
textbase 

ACF Access control file, controls simultaneous access by multiple users 

DBS Textbase structure file, contains field definitions 

IXL Indexed list file, contains the validation and substitution lists 

DBR Contains the records (including deferred new, deleted, or changed 
records) 

DBO Contains a directory to the records in the .DBR file 

SDO Contains a directory to records with deferred updates in the .DDBR 
file 

BTX Contains the Term and Word indexes for the fields 

occ Contains the lists of records indexed by the terms and words in the 
.BTX file 

LOG Textbase log file, lists changes to the textbase structure and records 

HLP Optional textbase-specific Help file 

INI Optional file used with the Copy Special feature 


information policy, our emphasis is on a features analysis of input, output and 
information retrieval facilities. 


3.2 Comparison 


We evaluated CDS/ISIS 3.07 and INMAGIC PLUS 1.0. Minimal RAM for 
CDS/ISIS is 640 Kbyte and for INMAGIC is 500 Kbyte. Hard disc space per 
program is 2 Mbyte and 1.5 Mbyte respectively. 


a) System limitations 
INMAGIC has no limitation on the record and field length, enabling storage of 
complete structured large documents. CDS/ISIS allows all fields to be indexed. 
The 200 lines of FST indicates that all fields can be indexed — each line may 
have more than one field. Limitations are summarised in Table 5. 

An INMAGIC database has one default data entry worksheet, while a 
CDS/ISIS database may have many. Each CDS/ISIS data entry worksheet may 
have several different pages. 


b) Database definition 
For CDS/ISIS, definition involves four steps: 


— Creating Field Definition Table 
— Creating data entry worksheet 

— Creating Field Select Table 

— Writing a display or print format. 


CDS/ISIS does not automatically create a default data entry worksheet based 
on the existing FDT. INMAGIC will automatically create a default data entry 
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Table 5. System limitations 
Characteristics CDSAISIS 3.07 INMAGIC PLUS 
Maximum number of databases unlimited unlimited 
Maximum number of 16,000,000 unlimited 
records/database 
Characters per record 8,000 characters unlimited 
Fields per record 200 75 
Characters per field 8,000 characters unlimited 
Maximum number of indexed fields 200 lines 50 
Fields per data entry 19 fields not applied 
worksheet page 
Maximum number of 20 pages not applied 
pages/worksheet 
Maximum size of a display formats 4,000 not applied 
Maximum number of stopwords 799 words Total length of stopword 
list including space 
<= 255 
Maximum length of indexed term 35 characters 59 characters 





worksheet and display format provided that the user has created a structure. 
Part of an example is shown in Figure 1. 

When there is a need for data structure modification, CDS/ISIS allows addi- 
tion of fields anywhere in the FDT. With INMAGIC, once the data structure 
has been saved the field order cannot be rearranged nor can fields be added 
except at the end of the structure. In CDS/ISIS, fields can be deleted without 
loss of data as they remain in the master file. 


c) Updates 

Although CDS/ISIS checks consistency of other database structure files with 
the changes made to FDT, it informs the user of the need for modification 
which must be carried out. If the indexing code or order key in an INMAGIC 
data structure is changed the Build or Remove options on all databases using 
that structure must be used. In the case of CDS/ISIS, in order for changes made 
to FST to have effect on the whole inverted file, the file must be regenerated. 


d) Copying data structure files 

INMAGIC allows many databases to share the same data structure file. A new 
data file may be generated using the existing data structure. In the case of 
CDS/ISIS, each database must have its own FDT, even when it is identical 
with other structures. There is no facility for directly copying the existing 
database definition files within the CDS/ISIS, except through DOS. CDS/ISIS, 
however, allows the users to create different customised data entry worksheets 
for databases, each of which may contain different fields, whereas INMAGIC 
has only one default data entry worksheet which is created automatically by the 
system. CDS/ISIS also allows copying from existing FST and print formats. 
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Type LABEL NAME INDEX SORT EMPHASIS 
for each field 

F/1 ID RECID y: 2 1 

F/2 TI TITLE Y 5 1 

F/3 TO TIT_ORIG N 

F/4 AU AUTHOR Y 5 1 

F/5 CO CORPORATE Y 5 1 

F/6 JO JOURNAL T 5 1 

F/7 SO SOURCE T 2 1 

F/8 SE SERIES T 5 1 

F/9 URL INTERNET T 5 1 

F/10 CF CONFERENCE T 5 1 

F/11 PY PUB_YEAR T 1 1 

F/12 PD PUB_DATE N 

F/13 VOL VOLUME N 

F/14 NO NUMBER N 

F/15 PG PAGE N 

F/16 LA LANGUAGE Y 5 1 

F/17 DT DOC_TYPE T 1 1 

F/18 AB ABSTRACT K 1 1 

F/19 DE DESCRIPTOR Y 5 1 








Figure 1. Example INMAGIC data structure 


e) Creating auxiliary files 

Auxiliary files include files for stopwords, the ‘any’ file, validation, and search 
prompts. INMAGIC has 14 default stopwords to which additions may be 
made. INMAGIC has a standard option for creating validation files and search 
prompts. A validation file is not used by standard CDS/ISIS and the stopwords 
file and ‘any’ file must be created by text editor programs. 


f) Security 

CDS/ISIS can exercise the use of password and security to some extent. It does 
not provide user internal facility to define passwords to restrict access to cer- 
tain functions or files. However, SYSPAR.PAR can be used to create ‘pass- 
words’ that redirect users to particular menus, worksheets, or display formats 
which hide certain functions or fields. INMAGIC allows users to control 
master password for system manager and passwords for users with access to 
certain fields. 


g) Data maintenance 

Table 6 summarises the comparison of data input features. Both packages have 
provision for batch input — ASCH files in the case of INMAGIC, ISO2709 
format in the case of CDS/ISIS. Because CDS/ISIS is developed in relation to 
the standard, it is able to make use of the delimiters and subfield elements that 
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Table 6. Data input and maintenance 








Characteristics CDS/ISIS INMAGIC 
Direct input from keyboard yes yes 
Screen editor included or separate built-in built-in 
word processor 
Editing and deleting existing records yes yes 
Acceptance of data in extended of 8-bit yes yes 
ASCII code 
Data to be entered in a record structure with yes yes 
separate fields 
Availability of predefined record structures yes yes 
Creating a permanent default settings template yes yes 
Creating temporary default setting template yes yes 
Field contents to be checked for type of data no yes 
or against authorised list 
Compulsory input option for certain fields yes yes 
Direct input from other sources without no no 
conversion 
Global editing of information in specified no yes 
fields or common information 
Flexibility of batch input format ISO 2709 from a text file 





are embodied in its structure. It can therefore accommodate implementations 
of ISO 2709 such as MARC and the CCF (Common Communications Format). 

In a CDSASIS database, fields can be deleted without causing damage to 
existing master files before reorganisation of the master file or export into an 
ISO 2709 file, but data cannot be edited in the deleted fields. CDS/ISIS also 
allows field attributes (length, pattern, type) to be changed without damaging 
an existing database, though if fields are shortened then records containing 
longer fields cannot be edited without restoration of the original length. This 
problem is not present with INMAGIC because it does not have field length 
limit or definition. INMAGIC does not provide internal backup facilities. It 
uses DOS BACKUP and RESTORE. CDS/ISIS, however, has its own backup 
and restore utilities. These work only on the master file. The database structure 
file must be copied separately. 


h) Index creation 

Both packages permit a variety of indexing methods. INMAGIC enables either 
keyword or term indexing or the two combined. CDS/ISIS has nine options, 
based upon complete fields, subfields, or sequences defined by indicators. 
Table 7 lists the features. 


i) Retrieval 

Both packages provide most standard information retrieval features, although 
the implementation differs. For example, searching through indexes requires 
an indication of index in INMAGIC, but CDS/ISIS defaults to all fields. 
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Table 7. Indexing features 








Indexing characteristics CDS/ISIS INMAGIC 
Maximum number of characters indexed/term 30 59 
Maximum number of fields indexed per database 200 50 
Indexing techniques/methods 9 3 

Indexing whole field yes yes 
Indexing every word of the field yes yes 
Indexing phrase of compound terms yes no 
Stopword list yes yes 
Deferred indexing yes yes 
Maximal number of indexes 1 100 





Boolean searching in CDS/ISIS takes effect on an entire record, whereas with 
INMAGIC it may be specified within or between fields. For proximity search- 
ing INMAGIC permits specification of words in same order, or regardless of 
order. CDS/ISIS uses the period symbol to indicate that there are at most n-1 
intervening words (A.B means adjacent, A...B means within at most two 
words of each other). It uses the dollar symbol to indicate an exact number of 
intervening words (A$$B means exactly one intervening word). A comparison 
is shown in Table 8. 

Indexes generated by CDS/ISIS and INMAGIC can be printed. CDS/ISIS, 
however, generates printed indexes only in a file with predefined name 
(IFLIST.LST). It does not send the index to a printer. The lists may either be 
dumped (including the address of the term in the inverted file), or produced 
with only the order number, occurrences associated with the term, and the 
term itself. 

INMAGIC does not have a thesaurus searching facility and CDS/ISIS does 
not have a standard option for thesaurus searching. However, it has a special 
PASCAL program for thesaurus searching distributed together with the pro- 
gram. This module allows the users to use a thesaurus for selecting terms for 
creating a search expression. The thesaurus database is a separate database 
(THES) which contains terms and information about their relationship. 


j) Output 
Features for presentation of retrieved results are similar for the two packages, 
as shown in Table 9. Each requires use of commands for data display. They do 
not automatically display results if the search finds only a single record. 
INMAGIC has unformatted and formatted options for display. In unformatted 
display, one record a time is presented, and users can use arrow keys to move 
to a specific part of the displayed record. CDS/ISIS allows display of the latest 
search result only. It displays one screen page a time and the user can only go 
forwards by <ENTER> key. No backwards key is available. Selection of a spe- 
cific record in a search result isn’t allowed. 

Both packages permit customised formats. Some of the major differences in 
display format designing between the two packages are: 
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Table 8. Search facility comparison 
Feature CDS/ISIS INMAGIC 
Index searching yes yes 
Retrieval of non-indexed yes yes 
information (free-text searching) (command-line 
mode only) 
Index display yes yes 
Occurrences Occurrences shown 
not shown 
Move to search from selected yes yes 
index terms Can combine function 
keys (*, +, F, G,..) 
when selecting term 
to help create search 
expression 
Search specific fields yes yes 
Use field identifiers, Defined in search 
/(n) statement 
Default is record 
Boolean operators yes yes 
(AND, OR, NOT) * (AND), + (OR), & (AND), ,/ (OR), 
^ (NOT) apply to &- (NOT) 
complete record apply within fields 
Nested expressions yes yes 
with parentheses 
Set retention and back reference yes yes 
Proximity operations yes yes 


Choice between 
exact number or 
‘at least’ 


In order Pn, otherwise 
Wn 


Truncation (prefix only) yes($) yes(*) 
Ranging limited yes 
use VAL function 
in free text 
Save strategy no yes 





— CDSASIS does not support column display for tabular reports; 

— an INMAGIC report format contains five parts including page defini- 
tions, user question definitions, calculation definitions, page layout, and 
record layout, while CDS/ISIS display format contains only record 
layout information, other specifications for page definition and page 
layout are stored in the system and used just before printing; 


— CDSASIS does not provide for calculations. 


Because of the separation of record display information and page layout in 
CDS/ISIS, the screen display of information may differ from the actual output 
to a file or a printer. Both packages provide default display formats, however 
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Table 9. Output features 








Features CDS/ISIS INMAGIC 
Standard output formats no no 
Except Libraries 
and Law 
versions 
User-defined output formats yes yes 
Selection of specific fields for display or printing yes yes 
Sorting of search results or records yes yes 
with descending 
Highlighting of search terms no yes 
Formats for exchange with other programs no no 
Output of reference list according to journal styles no no 





in INMAGIC, the default format is always present, while the CDS/ISIS ALL 
format is used only if the present format is an empty one (no commands), or no 
format is selected. 

Sorting and subsorting procedures are available in each package, with 
descending sorts included in INMAGIC. In CDSAISIS the sort result can be 
loaded into a separate database which has fields corresponding to sort keys. 
The sorting parameters in CDS/ISIS can be entered directly or can be prede- 
fined by creating user-defined print worksheets. The worksheets facilitate the 
sorting procedures as they save the data entry when sorting or printing. 
However, users must be aware of their existence and know the names as the 
system does not prompt or let them know about the worksheets. Another dis- 
advantage of sorting is that the sorted results cannot be displayed within 
CDS/ISIS. You can have a look at its content only outside CDS/ISIS using 
word-processing software 


k) Assistance for users 


User assistance normally takes the form of print manuals and online help. Both 
packages are accompanied by this information. For CDS/ISIS there is an offi- 
cial manual published by Unesco” and a handbook by Buxton and 
Hopkinson.” The updates for version 3.0 and 3.07 are present in the README 
file accompanied with the distribution of the program or are described in vari- 
ous numbers of the ASTINFO Newsletter.” The manual is well organised with 
contents and index. Some countries (e.g. Vietnam?) also publish the manual in 
their national languages, based on the original. There is also a self-instruction 
package TEPACIS, for CDS/ISIS.** INMAGIC PLUS is distributed with com- 
prehensive manuals?’?*° written in a manner that is understandable by inexpe- 
rienced users. 

There is no facility for listing databases accessible for use in CDS/ISIS, 
which presents difficulties for those who cannot respond to the prompt 
Database name. This also applies when the user wants to select items such as 
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display/print formats, data entry worksheets, user print worksheets, save 
files. INMAGIC gives users more assistance. For example a database can be 
selected more easily because the database names in the current directory are 
listed, or the address can be specified to look for a database list. The same 
assistance is available when selecting report or display formats or saved search 
set number. 

The Help option is available in CDS/ISIS but is not comprehensive. It is 
available for data entry or entering data or parameters in the system work- 
sheets. Nowhere in the online system is there help about how to write a search 
expression, how to use operators, what symbols are used to express the opera- 
tors, how to select terms from the term dictionary, or what function keys may 
be used. Most of this material is present in the Manual. INMAGIC provides 
much more extensive online help. 

Both packages provide users with clear current status information, such as 
the number of records, and current display format. 


1) Error messages 


Buxton and Trenner® have noted that error messages are an important part of 
an interactive system and have significant effect on user’s behaviour. Perera’ 
concluded that the error messages in CDS/ISIS are not very user friendly. 
Many error messages are cryptic and difficult to comprehend even by experi- 
enced users. CDS/ISIS also produces ‘terminal errors’ — these are errors not 
detected and recognised directly by CDS/ISIS, such as when a storage device 
is out of space. They are detected by the PASCAL run-time error system, 
which terminates the program. In the evaluation INMAGIC was found to have 
more user-friendly error messages*”. 


m) Program development by users 


This is a strong point of CDS/ISIS whereas INMAGIC is a ready-to-use and 
closed package and users cannot change its interface, CDS/ISIS is like an open 
system providing users with programming facilities such as the thesaurus 
searching module (THES.PAS) to develop different modules to be run in CDS/ 
ISIS. Many different modules have been developed to enhance CDS/ISIS. For 
example, Chowdhury et al’! developed a module for online vocabulary control, 
and the HERISKO interface has been developed for CD-ROM search.*? 


4. Windows software evaluation 


WINISIS is the Windows version of CDS/ISIS (at time of evaluation in Beta 
test mode, a number of features were still to be implemented). DB/TextWorks 
is referred to as ‘other category’ software by Inmagic, Inc. and has been a com- 
mercial product since 1995. Some features of the packages are similar to their 
DOS predecessors, while others are new, improved or enhanced. In this section 
we concentrate mostly on the new or enhanced features.Both packages run 
under either Microsoft Windows 3.X, or Windows 95. DB/TextWorks can be 
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installed for network with Windows NT. WINISIS does not support long file 
and directory name — CDS/ISIS classic six-letters names for database names 
must be used. 

DB/TextWorks can handle more fields and indexes than DOS INMAGIC 
PLUS. For example, the maximum number of fields is 250, each of which may 
be indexed. Thirty different types of image formats are supported with graphi- 
cal output, facilitating display of stored photographs, line drawings, docu- 
ments, permits, maps, and other visual information. WINISIS does not handle 
images, but it has linking features that allow users to run an application outside 
WINISIS using information from the database. 


a) DOS to Windows Conversion 


WINISIS does not require conversion of data from a DOS CDS/ISIS database. 
All database files can be copied to the WINISIS database directory or location 
can be indicated by specification in the SYSPAR.PAR file. DOS CDS/ISIS can 
use WINISIS databases, though WINISIS display formats for graphics are 
incompatible. DB/TextWorks textbases are incompatible with INMAGIC 
PLUS databases. The conversion process however, is straightforward and can 
be implemented with standard tools provided. 


b) Input 


Field characteristics (text, alphabetic, pattern, numeric) for WINISIS are the 
same as CDS/ISIS, but DB/TextWorks supports an expanded range as indi- 
cated in Table 10. 

Both packages support keyboard and batch input. WINISIS, like its DOS 
version, accepts batch input from file in ISO 2709 format only. DB/TextWorks 
accepts INMAGIC tagged format exported from the DOS version, and 
Delimited ASCII format, which uses symbols to indicate record and field 
delimiter. Global editing is supported in both packages. Two other input 
improvements in WINISIS are: 


— Term Dictionary: on input, users activate, and drag-drop terms from it 
into the data entry worksheet, thereby improving consistency of data in 
the database; 

— a basic validation check on the field format and information is carried 
out from a user-defined file. 


The use of files containing authorised terms for inputting is not implemented in 
WINISIS. This facility is available in DB/TextWorks. Both packages enable 
templates for data entry (in DB/TextWorks, called Record Skeleton). WINISIS 
permits copying of information in existing record to new record. This feature 
is not present in the DOS version. 


c) Reorganisation of data file 


WINISIS retains its features from the DOS version, allowing users to reorder 
fields, change field names (but not tags), and change field attitude. 
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Table 10. DB/TextWorks field types 





Field Type 


Fields contain 





Text 


Number 
Date 


Automatic no. 
Automatic Date 


Computed no. 


Computed Date 


Any text or combination of alphanumerics, punctuation, 
spaces, and line breaks 

A series of digits that may be calculated or sorted as a value 
Dates, in any of the many formats that DB/TextWorks 
recognises 

A unique number generated by DB/TextWorks 

The current date or current date and time 

Automatically generated value derived from user-specified 
formula 

An automatically generated date derived from user-specified 
formula 


Image The name(s) of associated image file(s) 

Link Text matched against a corresponding field in another 
textbase for association 

Code Items in which case and punctuation should be considered 
significant during searches and sorts, such as chemical 
names or scientific formulas 

UDC Universal Decimal Classification numbers 





DB/TextWorks improves on INMAGIC PLUS by allowing reorder of fields if 
textbases are empty. 


d) Index creation 

DB/TextWorks can index all fields in a textbase so there may be up to 500 
(given that there may be two indexes — Term and Word, per field). It retains 
INMAGIC PLUS options. WINISIS reduces to five options (from the nine of 
DOS) because field identifiers can be displayed, eliminating the need for pre- 
fixes (such as AU=, TI=). 


e) Retrieval 
Facilities summarised in Table 8 continue to be available in the Windows ver- 
sion of the packages, however several enhancements have been implemented: 


— Index displays: term occurrences now appear in WINISIS displays, and 
the display may be limited to specified field subsets. 

— Search mode: WINISIS now permits an assisted mode as well 
(DB/TextWorks continues the DOS command query and Query-by- 
Example options). 

— Specified field searches in DB/TextWorks QBE forms can be cus- 
tomised by the users. One QBE can accept up to 4,000 characters. In 
selection of WINISIS fields to be searched, field identifiers are added to 
a term by consulting a Window that shows searchable fields. 

— Search history displays in a WINISIS window and clicking an element 
of the history will display results. 
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f) Output 
Features shown in Table 9 for DOS versions continue to apply together with 
enhancements as follows: 


— Windows fonts are present for each package and a WYSIWYG is imple- 
mented. DB/TextWorks form designer provides a Drag/Drop facility 
for ease of forms design. The use of fonts and font sizes is also possible 
in WINISIS, but the user is required to use commands to write the dis- 
play or print formats and to define the font table. 

— Search results display is improved in WINISIS by being able to click 
elements of search history. Any record in a set may be marked for later 
printing. DB/TextWorks records are marked for omission rather than 
printing, and search sets must be saved explicitly for later display. 

— Print previews are available with each package, and WINISIS allows 
change of predefined formats immediately before printing. Printing may 
then be directed to file or printer (including PostScript) as desired. 

— Hypertext linking is enabled in WINISIS but not DB/TextWorks. For 
example in WINISIS it can be used to link a record containing descrip- 
tive information about a picture with the picture file. The picture can be 
displayed by clicking on the link, which will execute image software for 
display. If a record has a link to an Internet document, the clicking on 
the link activates an Internet browser. 


g) Assistance for users 

The commercial product DB/TextWorks is distributed with a comprehensive 
well-organised manual.*? Sample databases are also distributed with the soft- 
ware. WINISIS being still in its test phase, did not have a printed manual. Files 
(README.WRI, and WINISIS.HLP) containing basic instructions are distrib- 
uted with the package. The CDS/ISIS 2.3 Manual” continues to be useful for 
the Windows version. DB/TextWorks has an extensive online hypertextual 
help. WINISIS also has useful help file with hypertextual links, but at the time 
of testing, it was not comprehensive. WINISIS now allows the users to select 
databases, data entry worksheets, display formats, etc. from a window that list 
available files. In general, Windows versions of the two products are more user- 
friendly, and provide users with more tools and easier interaction. WINISIS is 
considerably improved in comparison with the DOS version CDS/ISIS 3.07. 


5. Conclusion 


The evaluation indicates that the packages are powerful software for structured 
textual information management. They have many similarities in their basic 
features while having a number of distinguishing features. Features common to 
both versions of the packages are: 


— suitability for the management of structured textual information with 
variable record length; 
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— users define their own databases; 

— powerful search facility, with use of boolean and proximity operators 
allowing flexible and diverse search combination; 

— customisation of output formats, especially formats that are specific and 
characteristic for library and information work like indexes and sorts; 

— editor capacity allowing users to enter new data and edit existing data. 


Distinguishing features include: 


— InMagic products have more extensive online help and are more user- 
friendly, particularly when comparing DOS versions; 

— InMagic products handle more data types, which make them more flex- 
ible to use for purposes (e.g., administrative) other than conventional 
bibliographic databases; 

— some searching features are more powerful than that in CDSAISIS, for 
example range search and comparison; 

— InMagic products can produce reports in tabular formats that are not 
available with standard CDS/ISIS systems; 

— CDSAISIS and WINISIS have strong and diverse indexing techniques 
that can meet various requirement and a programming facility enabling 
customisation to meet specific needs including interfaces; 

— CDSASIS and WINISIS have language versions other than English for 
interfaces; 

— CDSAISIS and WINISIS accommodate international standard biblio- 
graphic transfer format; 

— the Thesaurus searching module written with CDS/ISIS Pascal gives the 
software the ability to use a Thesaurus for information searching. 


InMagic software is easier for novices to use and provides more assistance as 
well as application-specific versions. It is readily adaptable to non-bibliographic 
application, and for applications that require frequent amendment and update. 

CDS/ISIS and WINISIS provide comparable search and report facilities, 
and have the advantages of permitting customised language interfaces, and 
being freely available and supported by Unesco. The DOS version has seen 
wide implementation in developing countries, and it is to be expected that the 
Windows version will follow suit. 

For systems that have relatively similar basic features, it may seem obvious 
to choose the one that is free. However, account should also be taken of the 
sophistication of the potential users, and the cost of the length of the learning 
curve. There will be circumstances where it makes economic sense to make 
use of INMAGIC’s software because of the support provided, and where most 
required functions are already installed. WINISIS is much more dependent on 
self-help, but because it is developed within a standards framework and 
because it is extendable with PASCAL programs, it rewards time that can 
be invested. 
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